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Abstract. This work presents an empirical study of the evolution of the personal income distribution in Brazil. Yearly 
samples available from 1978 to 2005 were studied and evidence was found that the complementary cumulative distribu- 
tion of personal income for 99% of the economically less favorable population is well represented by a Gompertz curve 
of the form G (x) = exp [ exp (A - Bx)], where x is the normalized individual income. The complementary cumulative 
distribution of the remaining 1% richest part of the population is well represented by a Pareto power law distribution 
P(x) = 8 x~ a . This result means that similarly to other countries, Brazil's income distribution is characterized by a 
well defined two class system. The parameters A, B, a, 8 were determined by a mixture of boundary conditions, nor- 
malization and fitting methods for every year in the time span of this study. Since the Gompertz curve is characteristic 
of growth models, its presence here suggests that these patterns in income distribution could be a consequence of the 
growth dynamics of the underlying economic system. In addition, we found out that the percentage share of both the 
Gompertzian and Paretian components relative to the total income shows an approximate cycling pattern with periods 
of about 4 years and whose maximum and minimum peaks in each component alternate at about every 2 years. This 
finding suggests that the growth dynamics of Brazil's economic system might possibly follow a Goodwin-type class 
model dynamics based on the application of the Lotka-Volterra equation to economic growth and cycle. 
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1 Introduction 

The attempt to apply methods of physics to describe various 
features of societies has a long history, stretching as far back 
as Thomas Hobbes and William Petty [3 4|. Although the po- 
tential for misapplications is far from negligible [4 58], over the 
past two decades tools, methods and ideas originally developed 
to understand the fabric of the physical universe are being in- 
creasingly applied by physicists to describe and understand the 
inner workings of societies Q13I17I22I30153I55I67I681 . What 
started simply as an exercise in statistical mechanics, where 
complex behavior arises from simple rules caused by the in- 
teraction of a large number of components, due to the increas- 
ing interest of physicists in interdisciplinary research these ap- 
plications have been constantly growing and the area of what 
today is named as socio-economical physics, sociophysics and 
econophysics for short, was born in the late 1990s [5 10 20 271791 
As a consequence, old problems in what until recently was be- 
lieved to be the exclusive realm of economics are receiving 
fresh attention in econophysics and possible new perspectives 
and solutions are emerging. 

Our goal here is to focus in one of those old problems, 
namely in the work made over a century ago by the Italian 
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economist and sociologist Vilfredo Pareto [64), who studied the 
personal income distribution for some countries and years. He 
found out that the complementary cumulative personal income 
distributions followed a power law for those with high income 
|@] p. 245], ll52l p. 152], a result that turned out later to be con- 
sidered a classic example of a fractal distribution ifSTl p. 347], 
1 62]. Later results confirmed Pareto's findings, but the applica- 
tion of his personal income power law, also known simply as 
Pareto law B4 11621 . is limited to the very high income popula- 
tion (see below). The overwhelming majority of the population 
does not follow Pareto's power law distribution and, therefore, 
the characterization and understanding of the personal income 
distribution of the economically less favored still remains an 
open problem. 

There has been several recent studies about individual in- 
come distribution for different countries and epochs, modern, 
medieval and even ancient. For old societies, these studies in- 
clude ancient Egypt [1] and medieval Hungary around 1550 
[39|. A list of recent studies for modern societies carried out by 
both economists and econophysicists, and which by no means 
should be considered as exhaustive, includes Australia [6 19], 
Brazi l ifTfjl. China fl2l, France llrSBI G ermany ll65l, In dia l69l . 
Italy 1131651, Japan 1121281401721731741, Poland 1211581, Unite d 
Kingdom H25I38I65I761 and USA 1181131141241251481501761791 . 
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The results corning out of these studies are varied. Although 
most of them confirm the validity of the Pareto law at higher 
personal income data, characterization of the lower individual 
income distribution remains disputed. Gaussian, log-normal, 
gamma, generalized beta of the second kind, Fisk and Beaman 
distribution functions have been used to fit the data, as well as 



Dagum, Singh-Maddala and Weibull models H7I8I12I38I41I47I48 
Recently the exponential was found to produce a good descrip- 
tion for about 98% of the population at the lower personal in- 
come portion 061131141231241251261481761791 . 

Disparate interpretations for these distributions have also 
been advanced. Many interpretations are basically of statistical 
nature, invoking stochastic processes [6 24 41 62 66 71 1. Oth- 
ers attempt to draw analogies from physics. This is the case 
of Dragulescu and Yakovenko B24I25I7 8 79 1, who advanced an 
exponential type distribution of personal income analogous to 
the Boltzmann-Gibbs distribution of energy in statistical physics, 
and Chatterjee et al. [ 1 1|, who proposed an ideal-gas model of 
a closed economic system where total money and number of 
agents are fixed. 

The purpose of this paper is to study the personal income 
distribution of Brazil for approximately the last 30 years. Here 
we provide empirical evidence which confirms that Brazil also 
follows the Pareto law for the tiny group which constitutes the 
high personal income population. The other motivation of this 
paper was to try to determine whether or not the exponential is 
as good a descriptor for the Brazilian data as it is for the USA. 
Our results show that the exponential and, by extension, any 
function based on it, turned out to be a very poor descriptor 
of the lower income distribution in Brazil. Such a result led us 
to search for another simple function capable of describing the 
individual income distribution for the majority of the Brazilian 
population. We propose here the Gompertz curve 1 36 45 77 1 
as a good descriptor for the distribution of the lower income 
population. Although the Gompertz curve can be written with 
two parameters only, we shall show below that one of them can 
be linked to a boundary condition determined by the problem. 
This effectively leaves only one parameter to be fitted by the 
data. Therefore, here we provide empirical evidence that the 
personal income distribution in Brazil reasonably follows the 
Gompertz curve for the overwhelming majority of the popula- 
tion. 

Our results show that the individual income distribution 
data in Brazil from 1978 to 2005 are well described by both 
the Pareto law and the Gompertz curve. This time span con- 
stitutes virtually all data for the Brazilian individual income 
distribution available in digital form at the time of writing. We 
have calculated the parameters of both curves with their un- 
certainties for all years in this period, with exception of those 
when there was no data collection: 1980, 1991, 1994, 2000 (see 
Section|2]below). We also present the Lorenz curves, the Gini 
coefficients and the evolution of the Pareto index, that is, the 
exponent of the Pareto law, in this time span as well as a com- 
parison of the income share for the two groups, showing an 
approximate cycle with roughly a 4 year period. As it happens 
for other countries, we found evidence that the lower income 
population, represented here by a Gompertz curve, constitutes 
about 99% of the Brazilian population, with the remaining 1% 
richest being represented by a Pareto power law distribution. 



Similarly to other countries, such results characterize Brazil as 
being a well defined two income class system. 

The plan of the paper is as follows. Section [2]presents the 
income data of Brazil and discusses how the data reduction 
necessary for our analysis was carried out. Some results ob- 
tained directly from the data, such as the Lorenz curves and 
49 AO 6<5i[fficients are also shown. Section [3] presents our ana- 
lytical modeling by means of the Gompertz curve and Pareto 
power law complementary cumulative distribution functions. 
The results are presented in Section H] where one can find var- 
ious tables presenting the fitted parameters and plots showing 
the linearization of both the Gompertz and Pareto income re- 
gions with their fitted lines, as well as the evolution of the Pare- 
tian component income share relative to the overall income. 
Section[5]summarizes and discusses the results. 



2 The Data 

Personal income data for the Brazilian population is available 
in yearly samples called PNAD. This is a Brazilian Portuguese 
acronym meaning "National Survey by Household Sampling." 
IBGE, the Brazilian government institution responsible for data 
collection, formatting and availability, carries out the survey 
every September and the data is released usually about one 
year later. PNAD data has been systematically available dig- 
itally since 1978, although in 1980, 1991, 1994 and 2000 there 
was no data collection and, therefore, there are no PNADs for 
these years. IBGE also has digital PNAD data for 1972, but the 
file seems incomplete and without clear labels for each entry. 
In addition the 1972 data collection was apparently carried out 
by a very different methodology than the one adopted by IBGE 
from 1978 onward. For these reasons we considered the 1972 
PNAD data unreliable and discarded it from our analysis. 

PNAD comprises surveys of about 10% of households in 
Brazil. The released data is made of files with entries for each 
surveyed household, providing the total household's income, 
the number of people living in, a weight index representing its 
proportion to the complete set of households in Brazil, occu- 
pation of those individuals and many other entries which are 
not relevant for the present analysis. PNAD is a sampling, not a 
census, and the surveyed households' locations in Brazilian ter- 
ritory are carefully selected by IBGE such that once the weight 
index is used the final set should be very close to the complete 
real set. 

The most appropriate procedure to find the personal income 
from our data set would be to adopt some sort of "equivalence 
scale", that is, a tool allowing us to reach conclusions about 
how the total income in a household is shared among all of its 
members. One way of doing this is to allocate points to each 
individual in a household, such that the first adult would have a 
higher weight than other persons whose ages are, say, 14 years 
or older. Children under the age of 14 would be allocated an 
even smaller weight. For instance, the first adult would have 
a weight of 1 point, additional persons above 14 years would 
have 0.5 points and children would be allocated with 0.3 points. 
The idea behind this procedure is to differentiate the household 
members who consume, but do not produce income (children, 
for instance), from those who do both, but at different levels, 
and also take into account the fact that there are goods in a 
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household which are consumed by several individuals at the 
same time, like, for instance, washing-machines, kitchens, etc, 
and, therefore, a second adult would not consume as much as 
the first and would contribute more in raising the household's 
well-being. Using this procedure the income of children under 
14 years would be near zero, even though they share the house- 
hold's total income. Equivalized household income would then 
be obtained by dividing the total household income by the sum 
of the points attributed to the household members [18]. 

The major obstacle we faced in implementing such a dif- 
ferentiated equivalence scale with our data is the fact that the 
PNADs do not provide us with enough information to do so. 
What we have is a list of the total income in a household and 
the number of people living in. Under these circumstances we 
adopted an equivalence scale such that each individual is allo- 
cated a weight of 1 point. So, for each PNAD entry we divided 
the total income by the number of people living in, meaning 
that the household income is equally divided for every occu- 
pant. 

As mentioned above, each PNAD household entry has a 
supplied weight index corresponding to its relative importance, 
or representation, as regards the entire country. This means 
that although the survey comprises only a portion of Brazil- 
ian households, once we obtain the income of each individual 
in a particular home we multiply the resulting values by this 
weight in order to obtain the number of individuals with that 
particular income in the whole country. Thus, we end up with 
tables relating on one side a certain number of individuals and 
on the other their respective incomes. 

Brazil experienced runaway inflation and hyperinflation for 
most of the 1980s and early 1990s, resulting in a series of 
currency adjustments where many zeros were "dropped" from 
time to time and new currency names were adopted each time 
those adjustments became effective. Hyperinflation came to an 
abrupt end in 1994 when a new and stable currency, called real 
(R$), was adopted. This fact required the adoption of a method- 
ology such that the final data were somehow homogenized, 
otherwise comparison of data sets of different years would be 
problematic. Thus, our adopted procedure was of normalizing 
the income values by the average income of September of each 
year. In other words, let x' t be the ith income received on the 
month of September of a certain year given in one of the Brazil- 
ian currency units legally adopted in the country when the sur- 
vey was carried out. Then (x') is the average income value dur- 
ing the month of September of that particular year. We may 
now define the normalized individual income x, to be the ra- 
tio Xi = x' j l{x') so that Xi becomes currency independent. In 
this way we were able to produce tables listing the number of 
people in terms of currency free income values. This allowed 
us to generate distribution functions relative to the average per- 
sonal income in a certain year. This individual average income 
does change from year to year, as can be seen in tableQ] where 
the currency names, exchange rates and the average individual 
incomes on September of each year are presented. 

Our next step was then to divide the data in bins inasmuch 
as most data is clumped towards low income values. The data 
binning methodology adopted here is the standard one used for 
problems involving power law determination ll62l and which 
was previously used by these authors to derive the Zipf law for 



Brazilian cities [60|. The method consists of taking logarith- 
mic binning such that bins span at increasing larger intervals 
and every step is 10% larger than the previous one. This is ac- 
complished according to the rule below, 

Xj = l.l^.w (1) 

By following this procedure we were able to create for each 
year a sample of n observed values such that, 

{xj) : (j = 1, ...,«), Oi = x min ), (x min « 0.01), (n ~ 100). 

The purpose of this methodology is to achieve a sharp decrease 
in the statistical fluctuations in the tail due to the fact that bins 
with far smaller number of observed values, prevalent in the tail 
of the distributions, are prone to large fluctuations. This effect 
has the potential of creating a serious bias in the determination 
of the parameters by least square fitting |62|. To counteract this 
problem, it is known that an appropriate logarithmic binning 
is very effective at severely reducing the uneven variation in 
the tail, which means that the possible bias in the parameter 
determination by least square fitting l35l is, therefore, strongly 
reduced. 

After the steps described above were taken we were able 
to obtain cumulative probabilities by calculating the number 
of individuals whose income goes up to certain values and di- 
viding this value by the total number of individuals. The final 
results are shown in figures[T]and|2j where complementary cu- 
mulative probabilities are plotted against normalized income 
for each year of the studied time span. It is clear from these 
graphs that there are enough points to form an almost continu- 
ous and smooth curve. Therefore, from now on we will change 
the discrete variable xj to the continuous independent variable 
x representing the normalized individual income values. 

The data obtained with the procedures outlined above al- 
lowed us to calculate the so-called Lorenz curve B41I46L which 
measures the degree of inequality in income distribution, by 
setting the maximum income value to 100% and then calculat- 
ing the percentage of individuals who receive certain percent- 
age of the maximum income. Figures [3] and [4] show the Lorenz 
curves for Brazil from 1978 to 2005. 

Once the points forming the Lorenz curves had been calcu- 
lated we were able to obtain the correspondent Gini coefficients 
IB2I33I34I41L which measure the inequality of the income dis- 
tribution. This was done by numerically calculating the area 
below the Lorenz curves. Figure [5] shows the results. 



3 Modeling the Individual Income Distribution 

Anyone attempting to familiarize oneself with the recent lit- 
erature in econophysics will see that when physicists try to 
solve problems traditionally dealt with by economists they do 
so via a different perspective. That, of course, will be no differ- 
ent for the income distribution problem. We therefore believe 
to be fruitful to expose our viewpoints about how to approach 
the income distribution problem at the very beginning of our 
discussion. So, this section will start by outlining our model- 
ing perspective and how it differs from the traditional approach 
followed by economists. 
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Table 1. Currencies in Brazil from 1978 to 2005 and the average individual income (x'> calculated in September of a given year, (xf) is 
converted by the exchange rate of September 15th of each year and presented in US dollars of that particular day (source: Brazil Central Bank). 
The hyperinflation period is clearly visible in the evolution of the exchange rate. 
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The first aspect worth mentioning is that economists often 
fit their income distribution data by means of complex single 
functions with as many parameters as necessary 171811 212 11501651 
Fitting the whole dataset with single functions with four or 
more parameters may produce a better data fit, but the draw- 
back is that this kind of fitting does not give a better insight into 
the problem. The paramount objective of any physical mod- 
eling is to find the differential equations which describe the 
observed empirical pattern and, therefore, data fitting is only 
the very first step into that direction and must be made bear- 
ing in mind Occam's razor, which in this case means using 
simple functions with as little parameters as possible. Theo- 
retical assumptions of economic nature must be built into the 
differential equations and not in the empirical curves. There- 
fore, the relationships among the parameters should be a result 
of the dynamics of the model determined by the solutions of 
the differential equations. Using as many parameters as nec- 
essary in complicated functions which do not originate from 
some sort of dynamical analysis is not a promising approach 
to the income distribution problem because it will make the 
task of finding the underlying differential equations even more 
difficult, if not impossible. Perhaps this is one of the reasons 
why the approach of conventional mainstream economics to the 
personal income distribution problem has made little progress 
since Pareto's time towards developing a dynamical theory con- 
necting personal income generation and economic growth in 
the sense of Sraffa [75|, as pointed out by Gallegati et al. [31 1. 
Thus, simple functions with as few parameters as possible which, 



at the same time, offer a reasonable agreement with the data are 
certainly much more preferable. 

The second point is that there is a tendency among a sizable 
number of economists of following an axiomatic and mathe- 
matically guided approach to their problems as opposed to the 
empirically guided paths usually taken by physicists. The ma- 
jor trouble of approaching a problem guided almost exclusively 
by logic is that this often leads to paradoxical situations, where 
it is possible to deductively arrive at apparently sound conclu- 
sions, which at the same time are entirely unsound empirically 
- here Aristotelian physics comes to mind as an example. The 
empirically sound path means starting and staying as close to 
the real data as possible when studying any problem of eco- 
nomic nature and avoiding as much as possible any kind of a 
priori assumption. This is especially true during the infancy of 
a new area of study. Examples of successful theories which did 
not follow this path are exceedingly rare, even within physics. 
That does not mean we dismiss the power of theoretical rea- 
soning, but even 20th century theoretical physics is strongly 
anchored upon very solid empirical foundations. For this rea- 
son we believe that research in econophysics must always care- 
fully consider the real data in order to avoid at all costs hypo- 
thetical, often anti-empirical, a priori assumptions. For econo- 
physics to succeed it must not repeat the fatal traps of con- 
ventional neoclassical economics, which is based on too many 
anti-empirical assumptions, resulting in all too often compro- 
mised results II9I29I3 1 142I43I44I52I54I55I56I57I631 . 
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Fig. 1. Graph of the complementary cumulative probability of individual income F{x) plotted against the normalized individual income x for 
the month of September of each year in the time span of this study. Although Brazil experienced runaway inflation and hyperinflation from 
1981 to 1993 the plots show a remarkable similarity during and after this period. The major differences stem from the plots of 1978 and 1979, 
just prior to Brazil's great inflationary period. Due to the absence of reliable digitalized data before 1978 we were unable to ascertain whether or 
not these years were the last ones of a qualitatively different era regarding the income distribution in Brazil, which was then possibly terminated 
by the inflationary period. 



As a third point, it was mentioned above that Dragulescu 
and Yakovenko [24 25 78 1 proposed an exponential type distri- 
bution of personal income analogous to the Boltzmann-Gibbs 
distribution of energy in statistical physics under the motivation 
that "in a closed economic system money is conserved" [23 1. 
Similarly Chatterjee et al. IfTTI advanced an ideal-gas model 
of a closed economic system where total money and number 
of agents are fixed such that "no production or migration oc- 
curs and the only economic activity is confined to trading" 
IfTTI . Those results led to criticisms made by Gallegati et al. 
|[3"T1 who argued that industrialized economies are not a con- 
servative system, meaning that "income is not, like energy in 
physics, conserved by economic processes". This occurs be- 
cause although transactions, that is, exchanges are conserva- 
tive, "capitalist economies are (...) characterized by economic 
growth. And growth occurs because production produces a net 
physical surplus". Ref. |31| concludes by stating that "models 
which focus purely on exchange and not on production cannot 
by definition offer a realistic description of the generation of 
income in the capitalist, industrialized economies". 

Gallegati et al. lOTll may have a point regarding the develop- 
ment of a dynamical theory of production. However, the focus 



of the approach made by physicists on the personal income dis- 
tribution characterization problem has not been on this dynam- 
ical theory, which is obviously necessary, but has not yet been 
developed. So far, econophysicists have been mainly focused 
on the more modest aim of finding good analytical descriptors 
of the individual income distribution, not only for the very rich 
where the Pareto law is valid, but for the whole society. On this 
point the proposal of an exponential distribution is without any 
doubt a step forward since it seems to produce good agreements 
with the data of some countries and is a simple function, with 
one parameter only. Therefore, if the exponential function does 
not produce a good fit for the income data of Brazil (see be- 
low) we are entitled to ask whether or not it is possible to find 
another function with one, or two parameters at most, which 
could produce a good data fit for the Brazilian data and, per- 
haps, could also be useful for fitting the income data of other 
countries. 

As a final conceptual point, we should mention that in re- 
cent econophysics literature the words "income" and "wealth" 
have been used indistinctively. We believe this to be inappro- 
priate. In this article income is used as a generic term for any- 
thing gained by an individual in a specific period of time, usu- 
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Fig. 2. Continuation of figure Q] showing the complementary cumulative individual income distribution F(x) against the normalized individual 
income x in Brazil from 1992 to 2005. 



ally monthly or annually. It can be wage, pension, government 
grant, the revenue obtained from property or investment like 
rent or dividends, etc. However, we believe that income should 
not be confused with wealth, because although these two con- 
cepts are related, wealth is the result of saved, or accumulated, 
income, often inherited. In other words, income is a flux, an 
inflow of value that an individual receives, or earns, at a spe- 
cific time interval which, if accumulated, may become wealth. 
In turn, the investment of wealth in property, shares, etc, gen- 
erates income as rent, dividends, etc. The empirical findings 
that led to Pareto law were mostly derived from personal in- 
come data, although it appears reasonable to suspect that the 
personal wealth distribution should also follow a power law for 
those individuals with high wealth. 

3.1 Basic Equations 

Let 'Fix) be the cumulative distribution function of individual 
income, or simply cumulative income distribution, which gives 
the probability that an individual receives an income less than 
or equal to x. It follows from this definition that the comple- 
mentary cumulative income distribution Fix) will then give the 
probability that an individual receives an income equal to or 
greater than x. Clearly T(x) and F(x) are related by the follow- 
ing expression, 

T(x) + F(x) = 100, (2) 



where we have assumed the maximum probability as being 
equal to 100%. If both T(x) and F(x) are continuous and have 
continuous derivatives for all values of x, this means that, 

dT(x)/dx = f{x), dF(x)/dx = -f{x), (3) 

and 

f(x) dx = 100. (4) 

o 

Here fix) is the probability distribution function of individual 
income, defined such that fix) dx is the fraction of individuals 
with income between x and x + dx. This function is also known 
as probability density, but from now on we will call it simply 
as probability income distribution. The equations above lead to 
the following results, 

nx)-no)= f mdw, (5) 

Jo 

fiw) dw. (6) 

Although we found in our data a non-negligible number of 
individuals who earned nothing when the sampling was carried 
out, zero income values do not have a weight in the income 
distribution function and, therefore, it seems reasonable to as- 
sume those results to be of a transitional nature and dismiss 
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Fig. 3. The Lorenz curves for the individual income distribution of Brazil in the month of September of the respective year. The x-axis plots 
the % of individuals whereas the y-axis is the % of total income. 



them from our analysis by assigning zero probabilities. Simi- 
larly, very rich people are made of very few individuals such 
that their probabilities tend to zero. Note, however, that these 
two situations are limiting cases and should only be considered 
as true within the uncertainties of our measurements. There- 
fore, it follows from this reasoning that the boundary condi- 
tions below should approximately apply to our problem, 



/no) 

\T(oo) 



F(oo) s 0, 
F(0) = 100. 



(7) 



the cumulative distribution will be given by, 



( < x < x,), 

(X t < X < oo), 



and the probability density yields, 



g(x), 
P(x), 



( < x < x,), 
(x, < x < oo). 



(9) 



(10) 



3.2 Two Parts for the Income Distribution 

As discussed above, our approach implies searching for sim- 
ple functions to describe the income distribution. Therefore we 
shall divide this distribution in two distinct parts, one for the 
very rich and the other for the overwhelming majority of the 
population. To establish the notation, when divided that way 
the complementary cumulative distribution function of the in- 
dividual income will be written as follows, 



G(x), ( < x < x,), 
P{x), (x, < x < oo), 



(8) 



where x, is the transitional income value marking the transition 
between the two components of the income distribution. Then 



3.3 The Pareto Law 

It is a well known empirical fact that the richest portion of 
many, perhaps most, populations follows a Pareto power law 
of the form, 



P{x) =fix~ 



(11) 



where a and are positive constants. The parameter a is known 
as Pareto index or just the fractal dimension of the distribution, 
if we adopt the modern language of fractals 1511521 . This law 
is valid only for the region of high personal income, starting 
at x — x, and going up to the maximum value obtained in the 
observed dataset. As we shall show below our data presents 
compelling evidence that the Pareto law is valid in Brazil. 

It is well known that if the complementary cumulative dis- 
tribution is a power law, the probability distribution is also a 
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Fig. 4. Continuation of figure|3]showing the Lorenz curves of the income distribution in Brazil from 1992 to 2005. 
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Fig. 5. This figure shows the evolution of Brazilian Gini coefficient for most of the last three decades. The values shown in this plot are presented 
in table g] 
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power law. Therefore, the Paretian part of the income distribu- 
tion of the Brazilian population has a probability density given 
by the following expression, 

p(x) — a B x ( . (12) 
It clearly follows from this equation that p(o°) = 0. 

3.4 The Lower Income Region 

3.4.1 The Exponential 

The first obvious thing to do with our data in the lower income 
region was to follow the proposal of Ref. [24 1 and try an ex- 
ponential fit. Surprisingly, however, the results were not good. 
The semi-log plot clearly did not linearize our data, something 
that could only be achieved by removing the values due to very 
low income. Figures [6] and [7] show plots where we have at- 
tempted to fit the exponential to the Brazilian data and a simple 
visual inspection shows the inadequacy of this function to de- 
scribe the observed data points. Since other functions like the 
Gaussian or the Boltzmann-Gibbs are also derived from the ex- 
ponential, these graphs were enough to convince us to dismiss 
all functions based on a simple exponential as viable fits for the 
Brazilian data. We then started searching for other ways of rep- 
resenting the Brazilian income distribution, especially at very 
low income values. 

3.4.2 The Gompertz Curve 

In the process of searching for a simple function capable of rep- 
resenting our dataset we realized that the plot itself suggested 
taking the second logarithm of the complementary cumulative 
distribution. When doing so the data tended to follow a straight 
line, a result which immediately suggested adopting the Gom- 
pertz curve IPTTI to model the complementary cumulative in- 
come distribution of Brazil. This curve may be written as fol- 
lows, 

M-Bx) 



G(x) = e e 



(13) 



where A and B are positive constants. Section[5]below presents 
further discussions about this function. 

The definition of cumulative distribution and its comple- 
ment allow us to find the Gompertzian probability density in- 
come distribution of the Brazilian population. It can be written 
as follows, 

g(x) = B e^ Bx) e e(A ~ Bx) . (14) 



Therefore, as mentioned above, in what follows it will be- 
come clear that our data presents compelling evidence that the 
complementary cumulative individual income distribution in 
Brazil has two distinct components represented by a Gompertz 
curve and the Pareto power law, situation which, similarly to 
other countries, characterize Brazil as having a well defined 
two income class system as far as individual income is con- 
cerned. 

Both equations ( fTTb and ( TT3l > can be linearized and, there- 
fore, the unknown parameters can be obtained by linear data 



fitting. However, the boundary conditions (0 allow us to find 
the theoretical value for A and g(Q). These results may be writ- 
ten as follows, 

e eA =G(0), o A = ln{ln[F(0)]} = 1.53, (15) 

g(0) = 461 B. (16) 

The equations above are just different ways of expressing the 
boundary condition due to zero income individuals data. The 
fitting should produce values for A which will probably fluc- 
tuate around its theoretical result above. Finding the extent of 
these fluctuations is one of our goals, since they should indicate 
how much the approximations given by equations (O are valid. 
Nevertheless, it is an advantageous feature of our modeling to 
know beforehand one of the four parameters. As we shall see 
below, B can be determined by either data fitting or normaliza- 
tion, a fact which effectively leaves only two parameters, a and 
B, to be determined entirely by data fitting. 



3.5 Continuity Across the Gompertz-Pareto Regions 

It is desirable to investigate whether or not the cumulative in- 
come distribution remains continuous across the transition be- 
tween the Gompertz and Pareto regions. For this continuity 
to occur all parameters should obey the constraint equation 
G(x r ) = P(x,), that is, 



JA—Bxf) 



= Bx, 



(17) 



In addition, should the usual normalization of the probability 
distributions between the two regions possibly hold, the fol- 
lowing condition will need to be satisfied, 



poo px 

I f(x) dx = I 
Jo Jo 



Be 



(A-Bx) 



(A-Bx) 



dx + 



I a B x 

Jx, 



-(l+a) 



dx = 100. 



(18) 



It is straightforward to show that the normalization above to- 
gether with the boundary conditions (TTBT l lead to the same con- 
straint equation 117) . It is also simple to verify that the con- 
straint equation above can be solved once a, B and B are deter- 
mined by fitting, albeit finding x, from equation (flTt requires 
the use of numerical methods. Nevertheless, our preference is 
to determine x, directly from the observed data, leaving the re- 
maining parameters to be obtained by a mixture of data fitting 
and normalization. 



3.6 Exponential Approximation of the Gompertz Curve 

We can derive a convenient approximation for the Gompertz 
curve ( TT3l when it nears the Pareto region, i.e., for large values 
of x. In this case the term Bx dominates over the parameter A 

-Bx 

and equation ( 1131 ) reduces to G(x) x e .If we now define a 
new variable z = e~ Bx , then large values of x imply small values 
of z and the following Taylor expansion holds: 



r = l+z + z 2 /2 + z 3 /6 + ... ( z < 1). 



(19) 
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Fig. 6. These graphs show the exponential fit for the lower region of the income distribution. Clearly the exponential is not a good representation 
for the Brazilian income data. 



In view of this we may write the following approximation, 

JA-Bx) 



G(x) = e e 



1 + e~ ax (for Bx> A and e~ Bv < 1). (20) 



This result means that the Gompertz curve reduces to the expo- 
nential function when the personal income x is large enough. 
It also means that the Gompertz curve allows us to have one 
of its parameters as a boundary condition for the zero income 
situation at the same time as having an exponential feature for 
larger incomes. In addition, the probability income distribution 
as given by equation (TT4l can also be similarly approximated, 
yielding, 



g(x) -Be 



(A-Bx) 



JA-Bx) 



B e~ Bx (for Bx> A and e~ Bx < 1). 



(21) 



Note that the approximation above means leaving the very 
low income data out of our analysis, which in turn reduces 
our problem to the exponential fit, as proposed in Ref. [24|. 
A simple visual inspection of figures [6] and [7] shows that the 
data seems to be fairly represented by an exponential if we re- 
move the very low income dataset (x < 2). This feature may 
explain why the exponential is such a poor representation of 
our income data. Brazil is notoriously a very unequal country 
in terms of income distribution and, therefore, our data tend to 
clump towards low income values. 



Finally, the approximations < f20b and (f2Tb also mean that 
the exponential and the Gompertz curve are not very dissimi- 
lar to one another in terms of being good representations of the 
non-Paretian part of the individual income distribution. So, the 
case for the Gompertz curve is made on the grounds of a better 
data fit, especially considering the very low income values that 
are strongly represented in the Brazilian income dataset, and 
its possible interpretation as a growth curve in the context of 
attempting to connect personal income with industrial produc- 
tion and economic growth (see Section|5]below). 



3.7 Average Income 

The mean income of the whole population may be written as 
follows, 



{x) = 



J"O0 
x f(x) ax 

Jo f(x)dx 



1 

100 



f 

Jo 



x B e 



(A-Bx) 



JA-Bx) 



dx + 



lim 



*JXt 



xaBx- (1+a) dx 



(22) 



The solution of the last integral on the right hand side yields, 



lim. 



f 



xaBx- (1+a) dx = 
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Fig. 7. Continuation of figure[6]showing how poor representation is the exponential for the complementary cumulative income distribution data 
in Brazil. 



lim x 



\(1 -a) L nm 



■ x, 



(l-ot) 



(23) 



Clearly this limit will only converge if the Pareto index is big- 
ger than one. Possible non finite averages may happen with 
power laws as discussed in Ref. [62] . Indeed, datasets of fi- 
nite sizes will produce a finite average since we can take x 
as being the maximum dataset value and cut off this integral 
above some upper limit. Nevertheless, this is not the case of 
income distribution because although there are extremely rich 
individuals, if we make more measurements and generate a 
larger dataset we will eventually reach a value of x such that 
the chance of getting an even larger value will indeed become 
zero, since even super-rich individuals do not receive an infi- 
nite income and their numbers are finite. In other words, as we 
go to larger and larger individual income datasets our estimate 
of (x) will not increase without bound. We therefore can con- 
clude that the condition a > 1 is an empirically necessary re- 
quirement for the Pareto law to hold, which is just another way 
of stating that the boundary condition F(oo) s is empirically 
sounding. In such a case equation d22b reduces to an expression 
which may be written as below, 



1 

Too 



(a - 1) 



(for a > 1), 



(24) 



where I(x) is given by the following, numerically solvable, in- 
tegral, 



I(x) = I w g(w) dw 



w B e 



(A-Bw) 



JA-Bw) 



dw. (25) 



4 Results 

4.1 Parameters of the Gompertz Curve 

To determine A and B we carried out a least squares fit since in 
this region the dataset does not exhibit large fluctuations which 
can cause large fitting bias, as discussed in Goldstein et al. 
(2004). However, to do so we first need to find ^ gmax , that is, 
the maximum value of x that marks the end of the Gompertz 
region. The boundary conditions (0 and < fT3T > imply A = 1.53 
and, therefore, we assumed that the end of the Gompertz region 
is reached when a value for x is found such that the straight 
line fit of {In [lnG(x)]} produces A = 1.5 ± 0.1. By following 
this methodology we were able to determine the specific value 
of Jfgmax for our dataset and fit the Gompertz curve. Plots are 
shown in figures [8] and [9] and the results are summarized in 
table |2] where one can verify that the result A = 1.54 + 0.03 
encompasses the whole period under study, that is, from 1978 
to 2005. Hence, in the time period of our analysis A varies no 
more than 2.6% from its boundary value given in equation ( fT5l >. 
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Fig. 8. Plots showing the fit of Gompertz curve to Brazil's individual income distribution data. The y-axis is the double logarithm of the 
cumulative distribution, that is, (In [InF]}, whereas the x-axis is the normalized individual income x up to the value where A as 1.5. The dashed 
line is the fitted straight line. Clearly the fit is good up to very small values of x, a result which brings support in favor of the Gompertz curve 
as a good model for the income distribution of the economically less favored individuals in the Brazilian population. Values of the parameters 
resulting from the fit are presented in table|2] 



Regarding the other parameter, the results are also stable from 
1981 to 2005. However, B was found to be higher in 1978 and 
1979, a result which is probably related to the fact that in these 
years the income distribution behaves differently (see the cap- 
tion of figure [TJ. 



4.2 Parameters for the Pareto Law 



To fit the Pareto law we need to determine x pm i n , that is, the 
minimum value of x that marks the start of the Paretian part 
of the income distribution. In most years our data clearly indi- 
cated that Xpmin ought to be equal to x gmax . Nevertheless, due 
to the previously discussed anomaly of the income distribu- 
tion in 1978 and 1979, the data for these years showed that 
Xpmin > Xgmax- Inasmuch as from their definitions it is obvious 
that Xgmax < x, < Xpmin, for 1978 and 1979 the transition in- 
comes between the Gompertzian and Paretian regions and their 
uncertainties are evaluated as follows, 

X/ — ~ fxpmin ~t" Xg m ax) ) — T f Xpmin Xgmax 1 ■ (26) 



Clearly if x gm ax = x pm i n , then x, = x pm i n and 5x, = 0. These 
quantities were then calculated in our dataset and the results 
are presented in table [3] 

The parameters a and B were evaluated by two different 
methodologies, least squares fitting and maximum likelihood 
estimate. Details of both methods and comparison of the results 
are described in what follows. 



4.2.1 Least Squares Fitting 

This fitting method is not recommended when the data shows 
large fluctuations, unless some binning process is employed 
such that these fluctuations are severely reduced. As discussed 
in Section |2] our data was treated that way and, therefore, we 
believe that presenting the Pareto law parameters obtained by 
least squares fitting (LSF) is useful, especially in order to com- 
pare than with the other fitting method described below. 

Figures [TOl and fTTI show the tail of the complementary cu- 
mulative distribution where one can clearly identify the power 
law decay in the data plots of all years. These figures also show 
the straight line fitted by least squares. Table [3]presents the val- 
ues of the parameters found by LSE Once can clearly notice 
that both parameters of the Pareto law dot not remain as stable 
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Fig. 9. Continuation of figure [8] showing the fit of the Gompertz curve for data from 1992 to 2005. 



as the parameters of the Gompertz curve during the time span 
of our analysis. 



We must note that Cowell et al. 1 16 1 have previously pre- 
sented evidence for the Pareto law in the Brazilian individual 
income distribution. Nonetheless, their study was restricted to 
the shorter 198 1-1990 period than the one considered here and, 
even so, they only analyzed data for three years: 1981, 1985 
and 1990. They assumed a log-normal distribution for the re- 
gion of lower income, but found out later that a Gaussian dis- 
tribution does not fit well the data. They also took the unusual 
step of dividing the Pareto tail in two income range regions, 
one for the rich and the other for the very rich, without present- 
ing an adequate justification for such a procedure, but reaching 
conclusions about the "increased inequality amongst the very 
rich." This seems particularly odd if we bear in mind that the 
income region of the very rich is exactly where we have the 
least data and the statistical fluctuations are at their highest. As 
stated above, here we present a study with a larger time span 
and which includes all available data in the specified period, to- 
taling 24 yearly samples. We also advance the Gompertz curve 
as a good descriptor for the lower individual income population 
and found no evidence to support the claim made by Ref. ITBI 
of such two Paretian components. On the contrary, our data 
showed very clearly a well defined and unique Pareto tail in all 
samples. 



4.2.2 Maximum Likelihood Estimation 

This method is considered a better way of finding the Pareto in- 
dex because it deals well with the statistical fluctuations found 
in the tails of income distributions. Here we shall closely fol- 
low the approach proposed by Ref. [62] to derive the likelihood 
of our dataset. 

The constant /3 is obtained as a result of the normalization 
requirement ( fT~8l >. As seen above, this normalization is equiva- 
lent to the constraint equation (T% . Hence, 



P = x, c 



M-Bx t ) 



(27) 



This expression can be substituted into the probability density 
( fT2b . yielding, 

(A-B.f f ) -(1+a) 

p(x) — a x, a e x . (28) 

The likelihood of the data set is given by, 

ni — r JA-Bx,) -(1+qO 

p(xj) = I \ ax t a e e xj . (29) 

We can calculate the most likely value of a by maximizing the 
likelihood with respect to a, which is the same as maximizing 
the logarithm of the likelihood, denoted as £,. Such calculation 
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Table 2. Results of fitting the Gompertz curve to Brazil's income distribution data from 1978 to 2005. The parameters were obtained by least- 
square fitting and the respective errors by means of one thousand bootstrap resamples with replacement such that an average fitting and standard 
deviation can be obtained for each sample in order to estimate the uncertainties. Note the very good values of the correlation coefficients of 
the fitting. Since the Gompertz parameters do not change very much, we were able to reach an estimation valid for the whole period of our 
analysis. That yields A = (1.54 ± 0.03), B = (0.39 ± 0.08). Similarly, the end of the Gompertz region is given by x gmax = (7.4 ± 0.8). 



year A B x gmax correlation coeff. % of individuals in Gompertz region 



1978 


1.52 ±0.01 


0.46 ±0.01 


6.606 


0.997 


98.9 


1979 


1.54 ±0.01 


0.44 ± 0.01 


6.920 


0.997 


98.9 


1981 


1.55 ±0.01 


0.34 ± 0.02 


7.533 


0.992 


98.9 


1982 


1.55 ±0.01 


0.34 ± 0.02 


7.473 


0.993 


98.9 


1983 


1.54 ±0.01 


0.33 ±0.01 


6.910 


0.996 


98.7 


1984 


1.55 ±0.01 


0.33 ±0.01 


7.388 


0.994 


98.9 


1985 


1.54 ±0.01 


0.33 ±0.01 


7.490 


0.996 


98.9 


1986 


1.55 ±0.01 


0.34 ± 0.01 


7.112 


0.995 


98.8 


1987 


1.55 ±0.01 


0.34 ± 0.02 


7.626 


0.992 


98.9 


1988 


1.54 ±0.01 


0.32 ± 0.02 


8.140 


0.992 


98.9 


1989 


1.53 ±0.01 


0.32 ±0.01 


7.856 


0.995 


98.8 


1990 


1.54 ±0.01 


0.34 ± 0.02 


8.074 


0.991 


98.9 


1992 


1.56 ±0.01 


0.36 ± 0.02 


7.635 


0.989 


99.0 


1993 


1.54 ±0.01 


0.33 ±0.01 


7.674 


0.997 


98.8 


1995 


1.54 ±0.01 


0.33 ±0.01 


7.887 


0.995 


98.9 


1996 


1.55 ±0.01 


0.35 ± 0.02 


8.163 


0.989 


99.0 


1997 


1.55 ±0.01 


0.34 ± 0.02 


7.935 


0.992 


99.0 


1998 


1.54 ±0.01 


0.33 ±0.01 


7.628 


0.997 


98.8 


1999 


1.54 ±0.01 


0.33 ±0.01 


7.811 


0.994 


98.9 


2001 


1.54 ±0.01 


0.34 ± 0.01 


7.774 


0.996 


98.9 


2002 


1.55 ±0.01 


0.34 ± 0.02 


7.878 


0.993 


99.0 


2003 


1.54 ±0.01 


0.33 ±0.01 


7.374 


0.997 


98.8 


2004 


1.55 ±0.01 


0.34 ± 0.02 


7.653 


0.993 


98.9 


2005 


1.54 ±0.01 


0.33 ±0.01 


7.403 


0.997 


98.8 



leads us to the following results, where 

L = \nP(x\a) a = e nc . (33) 



^ [in a + a In x, + e (A - Bx,> - (1 + a) In xj] b = ^ In Xj . (34) 

7=1 7=1 



(A _ Bx \ v-i Remembering that a > 1, the square root of the variance in a 

= nlna + na\nx, + ne' *> - {\ + a) ^Jnxj. (30) wiU give us to. Therefore, we have that, 

7=1 

Setting dLlda = 0, we find, 5a = ^ <ff2) _ {a f^ (35) 

(31) 



a — n 



7=1 



where 



- b ^ +a) a a+n) x t na da 
<«> = ^ss . (36) 



I 

Apart from a slight notation change, this result is equal to equa- XLC/ r°° 

tion (B6) in Ref. f>2, despite the fact that this work adopts a I eT (l+a) a"x," a da 

different normalization, as can be seen when comparing equa- 1 

tion d27l > above to equation (9) of Ref. 11621 . Therefore, this ar, d ra 

change in normalization does not affect the estimation of the I e^ 1 ^ a {1+n) x t na da 

exponent of the Pareto law obtained by the maximum likeli- _ Ji ^7) 



f 



-b(l+a) n na 



a n x t na da 



hood estimator (MLE). 

To find the expected error in the estimation of a, the width 

of the maximum of the likelihood as a function of a should „ , , , , , , 

• j ... .. . t j. rx, , ■ .. , e Note that these two integrals can be solved numerically and that 

provide us with an estimate of da. Taking the exponential of , . . & r^—, , r 

J5ni ii . c a a ii iu a en both Xi and n in equations OjJ, i34h jibt and ji/j refer only 

equation (OOb allows us to find the likelihood as follows, , 1 , , M ,. ~r" 1 — " V— ' .rr^, „ 

to the observed normalized income values within the Pareto 

P(x\a) = ae- bil+a) a"x," a , (32) region, that is, Xj > x t . 
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After calculating 5a, finding 5B becomes just a matter of 
using standard error propagation techniques in equation ( l27l . 
Tableland figures Q~2] and Q~3] present the results of the Pareto 
law parameters obtained with the MLE. 

4.3 Percentage Populations and Percentage Share 

Once the Gompertzian and Paretian regions were established, 
we were able to find the percentage of the population in each 
component. The results shown in tables [2] and [3] allowed us 
to determine that from 1978 to 2005 the income region de- 
scribed by a Gompertz curve includes (98.85 + 0.15)% of the 
population of Brazil, whereas the Pareto region includes only 
(0.85 ± 0.45)% of the Brazilian population. These results are 
similar to the findings of Ref. [24] for the USA, showing that 
Brazil also has a two class income system where the over- 
whelming majority of the population belongs to the lower in- 
come class. 

It is of interest to obtain the percentage share of each of the 
two income components analyzed in this paper relative to the 
total income. Table [4] presents these results together with the 
Gini coefficients shown in figure [5] Due to the same reasons 
discussed above the data analysis for 1978 and 1979 is prob- 
lematic because there is a large uncertainty in the transition 
income x, between the Gompertzian and Paretian regions (see 
table [3]). Figure [14] shows the percentage share of the Pareto 
region and in this figure the uncertainties for 1978 and 1979 
appear as large error bars for the first two points. If we dis- 
miss these two points, after a careful look at the irregular curve 
formed by the variations of the Paretian percentage share we 
can see that there is an oscillatory pattern, although with chang- 
ing amplitudes, whose periods can be set as roughly 4 years. 
The maximum and minimum inflexion points seem to alternate 
at approximately every 2 years. 

It is interesting to know whether or not there is any possible 
correlation of this approximate cycling pattern with any other 
economic quantity. Figure [15] presents a plot of the gross do- 
mestic product (GDP) growth of Brazil in the same time period 
of figure [14] and, although we can also identify an approximate 
cycling pattern in this graph, its oscillation does not seem to 
correlate with the cycles in the percentage share of the Pareto 
region. 

As a final point, we should note that this approximate cy- 
cling pattern in the Paretian share could be consistent with a 
purely deterministic dynamical model based on the application 
of the Lotka- Volterra equation to economic growth and cycle as 
advanced long ago by Goodwin |37|. Although such a model 
predicts a very regular oscillation of the percentage share of 
the lower income class, this discrepancy with our data could 
perhaps be remedied by the introduction of perturbation tech- 
niques. We shall not pursue this issue further here l6TI . 



5 Conclusion 

In this paper we have carried out an analysis of the personal in- 
come distribution in Brazil from 1978 to 2005. We have made 
use of the extensive household data surveys collected and made 
digitally available by the Brazilian Institute for Geography and 



Statistics - IBGE in order to obtain 24 yearly samples of the 
complementary cumulative distribution function F(x) of indi- 
vidual income of Brazil in terms of the normalized personal 
income x. We have concluded that this distribution function 
is well described by two components. The first is a Gompertz 
curve of the form G (x) = exp [ exp (A - Bx)], valid from x — 
up to the transitional income x, and which includes (98.85 + 
0.15)% of the population. The second component of the com- 
plementary cumulative income distribution is a Pareto power 
law P(x) = B x~ a , valid from x, up. This includes the remaining 
(0.85 + 0.45)% of the population of Brazil. The positive param- 
eters A, B, a and B were all determined by a mixture of bound- 
ary conditions, normalization and data fitting in all 24 yearly 
samples. We also estimated uncertainties for these parameters. 
Lorenz curves and Gini coefficients were also obtained, as well 
as the evolution of the percentage share of both components rel- 
ative to the total income. The Paretian and Gompertzian shares 
show an approximate cycling pattern with periods of about 4 
years and maximum and minimum peaks alternating at about 
every 2 years. These results show that the income distribution 
pattern emerging from the present study allows us to character- 
ize Brazil as being fanned by a well defined two class system. 

The challenging questions posed by the results of this work 
concern the possible origins of the Gompertz curve. It seems 
quite reasonable to suspect that the underlying dynamics of in- 
come distribution should be intimately related to the dynamics 
of production and economic growth in industrialized capitalist 
economies. Since economic growth happens because produc- 
tion produces a net physical surplus, the search for the origins 
of the Gompertz curve in income distribution should perhaps 
focus in growth because this curve has been successfully ap- 
plied in models of population dynamics, particularly human 
mortality from where it has originated [70], population ecology 
[45 1 and the growth of biomass lf59ll . So, the Gompertz curve 
may provide an important clue connecting income distribution 
and economic growth as a result of net production surplus. And 
although in these applications the power of the first exponen- 
tial of the Gompertz curve is negative whereas in here it has 
a positive sign, such a difference may not be relevant to the 
connection just mentioned. These remarks should also be true 
for the logistic function, which share with the Gompertz curve 
the main feature of being S-shaped B45I77 1 and also appears 
in economic models. From a physicists' standpoint, it is well 
known that the dynamics of complex systems gives raise to 
fractal power law patterns similar to the Pareto law. So, patterns 
in economic growth, viewed perhaps as a complex dynamical 
system, could be the root cause giving raise to the Gompertzian 
and Paretian income distribution functions. 

We would like to express our gratitude to Humberto Lopes, Jose Luiz 
Louzada, Vera Duarte Magalhaes and Cristiano de Almeida Martins 
for their help with IBGE data. We are also grateful to two referees for 
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Table 3. Results of fitting the Pareto law to Brazil's income data. The transition values from Gompertz to Pareto regions are also shown. Due to 
the oddity of the data in 1978 and 1979 (see above), there are indeed uncertainties of x, in these years. Results of the parameters are presented by 
both methods, least squares fitting (LSF) and maximum likelihood estimator (MLE). The correlation coefficient obtained with LSF is included, 
as well as the percentage of the population in the Paretian region. 6a and SB in the LSF column were calculated as in the Gompertzian region, 
that is, by bootstrap replacement. The MLE method results in more stable values of a, whereas LSF results appear noisy. 
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Fig. 12. These graphs present the tail of the complementary cumulative distribution fitted by a Pareto power law whose exponents were obtained 
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Fig. 13. Continuation of figure [12] showing the Pareto power law fitted with the MLE from 1992 to 2005. 
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Table 4. This table presents the percentage share relative to the total of each of the two components that characterize the income distribution in 
Brazil. The Gini coefficients from 1978 to 2005 plotted in figure[5]are also presented. By definition these coefficients are obtained as the area 
in between the two curves in figures [3] and|4] The area below each of the Lorenz curves was estimated numerically. 
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Fig. 14. Plot showing the percentage share of the Pareto region relative to the total income. The error bars in 1978 and 1979 are due to the big 
uncertainty of the transitional income value x, in these years, which led to huge uncertainties in the income share of both the Gompertz and 
Pareto regions. Even if we dismiss the portion due to these two years, it is apparent an irregular oscillatory pattern with changing amplitude 
during the time span of our analysis. The maximum and minimum inflexion points seem to alternate roughly at every 2 years, whereas the 
period of this oscillation occurs at approximately every 4 years. If this oscillatory pattern is in fact a real feature of the income distribution 
in Brazil, the next maximum of the Paretian income share should occur in 2005-2007, while the next minimum should happen in 2007-2009. 
However, we must point out that this oscillatory pattern does not mean equilibrium. There was economic growth for most of the period shown 
here, albeit the growth rate was at times fairly modest. 
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Fig. 15. GDP growth rate in Brazil from 1978 to 2005. Although this graph also shows an approximate cycling pattern, the oscillation shown 
here does not seem to correlate with the cycle in the percentage share of the Paretian region presented in figure [T4l 



